Parameter tying for flexible speech recognition

نویسندگان

Jacques Simonin

S. Bodin

Denis Jouvet

Katarina Bartkova

چکیده

This paper presents two parameter tying techniques which enable a trade-off between computational cost and recognition performances of a speaker independent flexible speech recognition system working over the telephone network. Parameter tying is conducted at phonetic and acoustic levels. At the phonetic level, allophone and triphone based phonetic modeling are used simultaneously to achieve the best trade-off between computational cost and recognition performances. This decreases error rate with a controlled computational cost as compared to an allophone modeling. At the acoustic level, the tying is performed by clustering the Gaussian densities of mixture distributions. After clustering, a particular density may be use by several distribution. This allows the total number of Gaussian densities to be divided by two while improving the recognition performances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...

متن کامل

Flexible Parameter Tying for Conversational Speech Recognition

Modeling pronunciation variation is key for recognizing conversational speech. Previous efforts on pronunciation modeling by modifying dictionaries only yielded marginal improvement. Due to complex interaction between dictionaries and acoustic models, we believe a pronunciation modeling scheme is plausible only when closely coupled with the underlying acoustic model. This paper explores the use...

متن کامل

Recognizing Sloppy Speech

As speech recognition moves from labs into the real world, the sloppy speech problem emerges as a major challenge. Sloppy speech, or conversational speech, refers to the speaking style people typically use in daily conversations. The recognition error rate for sloppy speech has been found to double that of read speech in many circumstances. Previous work on sloppy speech has focused on modeling...

متن کامل

Large Vocabulary Continuous Speech Recognition: Improvements in Acoustic Modelling and Search

This paper describes the main improvements we made in two of the basic modules in our HMMbased large vocabulary speaker independent continuous speech recognition system: namely in the acoustic modelling and in the search engine. For the acoustic modelling, we paid special attention both to improved parameter tying at the density and at the state level, and to fast evaluation of the HMMs. For th...

متن کامل

Smoothing and tying for Korean flexible vocabulary isolated word recognition

For large vocabulary recognition system, as well as for flexible vocabulary applications using hidden Markov model(HMM), parameter smoothing and tying have been used to increase the reliability of models. This paper describes bottom-up and topdown clustering techniques for state level tying. This paper also describes a method of applying parameter smoothing to the clustered states and covarianc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

Parameter tying for flexible speech recognition

نویسندگان

چکیده

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Flexible Parameter Tying for Conversational Speech Recognition

Recognizing Sloppy Speech

Large Vocabulary Continuous Speech Recognition: Improvements in Acoustic Modelling and Search

Smoothing and tying for Korean flexible vocabulary isolated word recognition

عنوان ژورنال:

اشتراک گذاری